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INTRODUCTION 


This  paper  concerns  a  statistical  problem  in  estimating  relationships 
between  resources  and  readiness.  In  policy  language,  the  question  is  which 
term,  resources  or  readiness,  is  the  discretionary,  or  causal  variable,  and 
which  is  the  determined,  or  effect  variable.  In  statistical  terms,  the  question 
is  which  variable  to  put  on  the  left-hand  side  of  a  statistical  regression 
relationship.  Although  this  issue  could  arise  in  studying  any  area  of  the 
budget,  we  will  discuss  it  in  the  context  of  the  Naval  shore  establishment. 
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RELATING  RESOURCES  TO  READINESS  FOR  NAVAL 
SHORE  BASES 


To  help  manage  the  shore  establishmenVthe  Navy  has  been  taking  steps 
to  relate  resources  to  readiness  for  its  bases/ Such  relationships  can  be  useful 
in  determining  the  cost  of  bringing  deficient  bases  up  to  a  desired  readiness 
level,  or  allocating  a  given  BOS  budget  across  bases  in  order  to  achieve  a 
uniform  state  of  readiness. 
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This  paper  is  concerned  with  the  analytical  methodology  for  relating 
readiness  to  spending.  We  will  assume  the  Navy  has  managed  to  obtain 
reasonable  measures  of  readiness. 


The  question,  therefore,  is  how  estimating  the  relationship  between  the 
readiness  ( R )  and  BOS  cost  (C)  of  a  base  (in  either  a  cross-section  or  a  time- 
series  analysis.) 


MODEL 


To  obtain  causal  relationships  that  can  be  applied  to  policy  questions;  we 
first  need  a  model  that  describes  our  best  understanding  of  the  underlying 
mechanisms.  We  will  assume  that  the  Navy’s  management  of  the  shore 
establishment  can  be  described  in  the  aggregate  by  the  following  two-phase 
model.  The  Navy  is  given  a  total  budget  based  on  broad  objectives  ("We  need  a 
600-ship  Navy”),  and  the  CNO  divides  up  the  budget  based  on  requests  for 
support  by  the  sponsors  of  the  shore  establishment  and  the  other  activities 
that  comprise  the  Navy.  How  much  is  given  to  each  base  (C)  is  based  on 
several  types  of  variables:  the  installation’s  current  readiness  ( R ),  its  physical 
and  personnel  characteristics  (building  area,  number  of  military  people,  etc.) 
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represented  by  a  size  variable  (S),  a  host  of  other  factors  that  enter  the 
allocation  of  the  Navy  budget  ( Z ),  and  finally  an  error  term  (u): 

C  =  a0  +  axR  +  a2S  +  a2Z  +  u  .  (1) 

In  the  "spending”  phase  of  the  management  process,  each  base  spends  its 
budget  in  such  a  way  as  to  maximize  readiness.  The  readiness  ( R )  a  base 
actually  achieves  depends  not  only  on  its  budget,  but  also  on  its 
characteristics  (S),  a  variety  of  other  factors  ( W),  and  finally  an  error  term  (v): 

R  —  b0+  fcjC  +  b2S  +  b2W  +  v  .  (2) 

This  is  a  simultaneous  equation  model  with  no  specification  as  to 
dynamics— how  the  "allocation”  and  "spending”  phases  occur  in  time.  In 
reality,  what  the  Pentagon  gives  a  base  to  spend  at  the  start  of  a  given  fiscal 
year  depends  on  how  much  readiness  the  base  has  already  achieved  as  a  result 
of  prior  years’  allocations.  A  base,  moreover,  can  receive  budgetary  adjust¬ 
ments  during  the  year  to  handle  unforseen  problems  that  may  arise.  (It  is 
possible  that  the  problem  discussed  in  this  paper  would  not  arise  in  a  time- 
dependent  analysis  with  small  enough  time  intervals.  Simultaneous  models 
are  nevertheless  necessary  because  such  detailed  time-series  data  are 
typically  unavailable.) 

Estimating  the  full  two-step  model  (equations  1  and  2)  as  it  stands  is  out 
of  the  question,  at  least  at  present.  It  would  be  a  major  project  to  define  and 
measure  the  "other  factors”  (Z)  that  strongly  determine  how  much  BOS  funds 
the  Navy  gives  to  each  base. 

ESTIMATING  THE  "BASE  SPENDING”  EQUATION 

It  is  not  out  of  the  question,  however,  to  estimate  equation  2,  the  "base 
spending”  equation.  Data  on  cost  (C),  readiness  ( R ),  and  various  size  variables 
(S)  are  available  from  the  budget,  the  new  BaseRep  system,  and  other  sources. 
We  would  need  to  omit  the  "other  factors”  (W)  and  see  if  the  explanatory 
power  is  still  satisfactory.  "Other  factors”  are  intuitively  less  important  in 
determining  the  readiness  achieved  with  a  given  budget  (equation  2)  than  in 
determining  the  budget  itself  (equation  1). 

There  is,  however,  the  question  of  how  to  estimate  the  base  spending 
equation,  equation  2  (without  the  W).  We  are  not  concerned,  here,  with 
questions  like  whether  to  use  a  linear  or  log-linear  specification,  but  rather 
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the  more  fundamental  question  of  which  variable,  C  or  12,  to  treat  as 
endogenous— to  put  on  the  left-hand  side.  The  fact  that  equation  2  has  R  on 
the  left-hand  side  does  not  imply  that  the  equation  must  be  estimated  this 
way.  One  could  instead  estimate: 

C  =  d0  +  dxR  +  d2S  +  e  .  (3) 

The  choice  of  whether  to  estimate  equation  2  (without  the  W)  or  3  is 
called  the  "normalization”  problem  in  the  statistics  literature.  Roger  Klein 
shows  that  the  choice  should  reflect  the  criterion  for  ordinary  least  squares: 
right-hand  variables  should  be  uncorrelated  with  the  error  term.  The  full 
model  implies,  of  course,  that  neither  C  nor  R  is  rigorously  uncorrelated  with 
y;  v  affects  R  through  equation  2,  and  thereby  C  through  equation  1.  The 
question  is  which  variable,  C  or  12,  is  less  correlated  with  v. 

One  way  to  decide  this  is  to  take  the  full  model  and  solve  the  equations 
for  C  and  12;  then  calculate  the  correlation  coefficients  p2  (u,  C)  and  p2  (u,  12)  in 
terms  of  the  unknown  coefficients  of  the  model;  and  finally,  use  our  intuition 
about  these  coefficients  to  judge  which  p2  term  is  larger.  If  p2  (u,  C)  <  p2  (u,  R) 
then  C  is  more  exogenous  (less  dependent  on  v)  and  should  be  put  on  the  right- 
hand  side.  We  would  thus  estimate  equation  2  (without  the  W).  In  the 
opposite  case,  we  would  estimate  equation  3. 

We  can  do  the  same  thing  less  formally.  Suppose  we  assume,  for 
example,  that  ax  in  equation  1  is  small  —  that  the  principal  determinant  of  a 
base’s  budget  is  its  size,  with  readiness  being  an  adjustment  to  this  basic 
relationship.  In  this  case,  equation  2  would  be  the  one  that  should  be 
estimated.  Look  at  the  full  model  and  assume  that  v  changes.  This  affects  R 
by  equation  2  and  thereby  C  by  equation  1.  The  last  link  is  weak,  however, 
given  our  assumption  that  at  is  small.  The  fact  that  v  has  little  effect  on  C 
means  that  C  is  more  exogenous  than  R,  and  should  thus  be  on  the  right-hand 
side.  The  question  of  just  how  small  aj  must  be  is  derived  in  the  last  section. 

Alternatively,  if  we  assume  a2  is  small  — that  bases  with  low  readiness 
get  much  more  funding  and  size  is  now  the  adjustment— then  v  and  C  are 
closely  linked  and  equation  3  is  the  one  that  should  be  estimated. 


Note  that  choosing  an  underlying  model  is  the  only  way  to  select  which 
equation  to  estimate.  Looking  only  at  goodness  of  fit  is  inappropriate,  and 
sometimes  misleading.  For  example,  taking  aY  small,  which  leads  to  putting 
R  on  the  left-hand  side,  also  suggests  that  putting  C  on  the  left-hand  side 
would  yield  "better”  statistical  results.  If  the  effect  of  S  on  C  is  the  dominant 
relationship,  one  would  obtain  a  higher  R2  from  putting  C,  rather  than  R  on 
the  left  in  equation  2.  This  would  also  avoid  multicollinearity  problems 
between  C  and  S.  These  are  not,  however,  the  appropriate  criteria  for 
choosing  between  models  with  different  variables.  Theory  is  the  correct 
criterion. 

"INVERTED  PRODUCTION  FUNCTION”  MODEL 

There  is  an  alternative  approach  to  relating  BOS  cost  to  readiness.  The 
technique  involves  inverting  the  base’s  production  function. 

Regard  the  base  characteristics  (S)  as  outputs  of  the  BOS  resources.  The 
square  feet  of  floor  space,  for  example,  is  one  measure  of  output  maintenance 
resources  and  utilities  the  base  employs,  holding  readiness  constant.  More 
generally,  the  base  production  function  relates  output  to  the  factors  of  capital 
and  labor  employed  (readiness  held  constant): 

S  =  f(KJL)  =  MKaLb  •  (4) 

Next  is  the  cost  of  capital  and  labor  at  given  prices  ("r”  for  implicit  rent, 
"iy”  for  wage  rate): 

C  =  rK  -!-  wL  .  (5) 

The  next  step  is  to  minimize  cost  subject  to  a  constraint  on  output,  S. 
The  Lagrangian  method  gives  the  solution1: 


(J_)  (JL_)  (JL\ 

_  TTQ\a+6/  \ n+6/  \a+6/ 


where 


C  =  HS 


ff=(a  +  6)[-J-(-)a(7)6]Wa+6)  . 
M  a  b 


1.  Hans  Brems,  Quantitative  Economic  Theory,  Wiley  1968.  (HB-74-M3-B735),  p.  78. 
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BOS  funding  level  is  thus  a  function  of  the  workload  and  the  coefficients 
of  the  production  function  (still  holding  readiness  constant). 

Since  we  might  use  a  variety  of  workload  measures,  and  since  the 
coefficients  of  the  production  function  are  unknown,  we  end  up  with  a  cost 
function  to  be  estimated  by  regression  analysis: 

C  —  d0  +  djSj  +  d2S2  +  ...  +  e  .  (7) 

The  final  step  is  to  recognize  that  we  would  obtain  different  cost 
functions  for  different  levels  of  readiness.  A  million  square  feet  can  be  main¬ 
tained  in  good  shape  for  $X  million,  and  in  better  shape  for  $(X+ A)  million. 
We  can  reflect  this  dependence  by  expanding  the  cost  function  to  include 
readiness  as  an  explicit  variable  (i.e.,  readiness  as  simply  another  output): 

C  —  d0  +  +  d2S2  +  •••  dfiR  +  e  •  (8) 

Even  under  this  alternate  specification,  there  is  still  the  basic  issue  of 
which  variable  belongs  on  the  left  hand  side  and  which  belongs  on  the  right- 
hand  side.1 

SPECIFICATION  OF  THE  "BASE  SPENDING”  EQUATION: 
RIGOROUS  TREATMENT 

In  this  section  we  compare  the  two  versions  of  the  "base  spending” 
equation  to  be  estimated  in  terms  of  formal  statistical  properties: 

R  —  b0  +  6jC  +  b2S  +  v  (9) 

C  =  d0  +  d^R  +  d2S  +  e  .  (10) 

Note  that  equations  9  and  10  are  exactly  the  same  equation  except  that  the 
variables  C,  R  have  been  interchanged.  The  parameters  in  equations  9  and  10 
are  obviously  closely  related.  For  example,  bx  —  l/dr 

1.  There  is,  of  course,  a  mathematical  duality  between  maximizing  output  subject  to  a  cost 
constraint  and  minimizing  cost  subject  to  an  output  constraint.  The  duality  does  not  carry 
over,  however,  to  statistics:  V-  icause  thr  error  term  is  not  related  equally  to  the  left-hand 
and  right-hand  variables,  it  does  matter  for  normalization  whether  cost  or  output  is  put  on 
the  right-hand-side. 


The  question  for  analysis  is  which  version  ia  preferred  on  statistical 
grounds. 

To  compare  equations  9  and  10,  they  must  be  combined  with  the  other 
equation  in  the  system,  the  reaction  function.  We  will  analyze  equation  9 
along  with  equation  1  from  the  previous  section. 

The  system  to  be  analyzed  is  thus: 


—  a0  +  aft  +  a2S  +  a2  Z  +  u 

(1) 

R  —  b0  +  b^C  +  b2S  +  v  . 

(9) 

The  question  of  whether  to  estimate  equation  9  or  1C  comes  down  to  the 
question  of  whether  R  or  C  is  less  correlated  with  v  and  thus  will  cause  the 
least  damage  on  the  righ"  hand  side.  More  formally,  Klein1  shows  that  (aside 
from  a  term  that  vanishes  as  a*  -»  0)  the  mean  square  errors  (MSE)  of  the 
parameter  estimates  of  60,  bv  b2  will  be  minimized  by  putting  on  the  right- 
hand  side  the  variable  with  the  lesser  correlation  with  the  right-hand  side 
error  term,  v.  Note  that  the  MSE  criterion  depends  on  the  initial  norm¬ 
alization,  i.e.,  the  analyst  must  decide  whether  he  is  interested  in  the  MSE  of 
the  parameters  of  the  production  function  or  the  cost  function.  The  optimal 
normalization  seems  likely  to  be  the  same  regardless  of  the  initial  norm¬ 
alization,  but  this  is  not  guaranteed.  We  have  chosen  the  initial  normaliza¬ 
tion  with  R  as  the  dependent  variable,  since  this  is  the  form  with  the  policy 
relevance  (the  policy  issu_  is  the  effect  of  C  on  R). 

We  now  turn  to  determining  which  variable  R  or  C  has  the  lower  cor¬ 
relation  with  v.  We  start  by  rewriting  the  system  in  a  more  compact  notation: 

f  +  Ay  +  Bx  +  e  =  0  , 


1.  Roger  W.  Klein,  "K-Class  Estimators:  The  Optimum  Normalization  for  Finite  Samples,” 
Journal  of  the  American  Statistical  Association,  June  1973,  Vol.  68,  No.  342,  pp.  445-451. 
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Define  G  =  (—A)~l  with  typical  element g  : 


With  this  notation 


G  =  P“  H 


S21  822 j 


—  *i  r*  * 

g  a  gi2  ®2(u)  0 

E(yz')  = 

*21  £22J  L°  °2(y>. 


rvi 


mi 


o(C,u)  gi2o2(v) 

The  second  column  of  E(yz')  is  = 

o(#,o}J  L^22a2^L 

R  will  be  preferred  as  the  left-hand  side  variable  provided  that  pHR,v)  > 
p2(C,u).  Klein  shows  that  an  equivalent  condition  is  that: 
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1  +  2 bl  cov  ( m2 ,  0*)  >.  0 

where  v*  =  —  and  m2*  =  m^Joiv) and  m2  is  the  residual  in  the  reduced  form 
o(u) 

equation  for  C. 

Thus,  the  condition  is  rewritten 


1  +  — —  cov  On  v)  >  0 
a2(v) 


=  l  +  — —  cov  (C,  v)  ~z  9 


W  1 

=  1  +  "r:*i2o2a,)-0 

O  (u) 


=  1  +  2fej  , 


m ® 


$rrT- 
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At  this  point  (to  evaluate  gl2)  we  need  to  evaluate  the  matrix  G  =  ( -  A)~ 1 . 


so  thatg12  =  (1  —  at  f^)-1  ax  . 

The  condition  for  R  to  be  the  preferred  left-hand  variable  is  now: 

1  +  26^  >0 
1  +  26^(1  —  a^)-1  0 

or 


This  condition  is  satisfied  so  long  as  |ol6l|<l.  If  al  and  bx  are  positive, 
the  requirement  (for  R  to  be  the  left-hand  variable)  is  that  ax  be  small  in  the 

1 

sense  that  a.  <  — 
f>i 

Graphical  Analysis 

The  conditions  we  have  derived  can  be  illustrated  graphically.  The  solid 
line  is  the  production  function  for  readiness,  the  dotted  line  the  reaction 
function. 


R 

Fif.  1:  Graphical  Analysis  of  the  Normalization  Condition 

Suppose  1  corresponds  to  v  =  0.  When  v  takes  a  positive  value  1  shifts 
to  2.  Because  of  this  increase  in  v,  both  R  and  C  increase,  i.e.,  they  both  have  a 
positive  correlation  with  the  error  term.  The  correlation  of  R  and  v  will  be 
large  relative  to  that  between  C  and  v  if  the  dotted  line  is  flat.  If  the  dotted 
line  is  horizontal,  C  does  not  vary  at  all,  i.e.,  it  is  uncorrelated  with  v. 

"What  the  figure  indicates  is  that  a  low  value  of  al  (the  slope  of  the  dotted 
line),  leads  to  R  being  the  more  endogenous  variable  and  thus  the  preferred 
left-hand  variable. 

Summary 

The  decision  of  which  equation  to  estimate,  (2)  or  (3) -the  "normal¬ 
ization”  question)— depends  on  whether  C  or  R  is  more  correlated  with  the 
error  term,  not  on  the  value  ofR2. 

Theoretically,  either  normalization  is  possible. 

In  practice,  we  suspect  that  bx  is  small:  a  base’s  readiness  is  a  function  of 
many  years  past  spending  and  extra  funds  this  year  will  not  make  a  big 
change.  If  6j  is  small,  then  \lbx  is  large.  Further,  a.t  is  likely  to  be  small:  a 
base’s  budget  is  not  heavily  dependent  on  its  readiness.  Thus  ax  is  likely  to  be 
less  than  1/61  so  thatR  is  the  appropriate  left-hand-side  variable. 
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